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ABSTRACT 

We investigate how the shape of the galaxy two-point correlation function as measured in 
the zCOSMOS survey depends on local environment, quantified in terms of the density con- 
trast on scales of 5 h _1 Mpc. We show that the flat shape previously observed at redshifts be- 
tween z — 0.6 and z — \ can be explained by this volume being simply 10% over-abundant in 
high-density environments, with respect to a Universal density probability distribution func- 
tion. When galaxies corresponding to the top 10% tail of the distribution are excluded, the 
measured w p (r p ) steepens and becomes indistinguishable from AC DM predictions on all 
scales. This is the same effect recognised by Abbas & Sheth in the SDSS data at z ~ and ex- 
plained as a natural consequence of halo-environment correlations in a hierarchical scenario. 
Galaxies living in high-density regions trace dark matter halos with typically higher masses, 
which are more correlated. If the density probability distribution function of the sample is 
particularly rich in high-density regions because of the variance introduced by its finite size. 
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1 INTRODUCTION 

Advances in the spectroscopic survey capabilities of 8-meter class 
telescopes have allowed us in the recent years to extend detailed 
studies of the clustering of galaxies to the z ~ 1 Universe (Coil 
et al. 2004; Le Fevre et al. 2005; Polio et al. 2006; Coil et al. 2006; 
Meneux et al. 2006; de la Torre et al. 2007; Meneux et al. 2008; 
Coil et al. 2008; Abbas et al. 2010). The most recent contribution 
to this endeavour is the COSMOS survey (Scoville et al. 2007), and 
in particular zCOSMOS, its redshift follow-up with VIMOS at the 
ESO-VLT (Lilly et al. 2007). 

Early angular studies of the COSMOS field (McCracken et al. 
2007) and more recent analyses of the first ten thousand zCOS- 
MOS redshifts to Iab = 22.5, have evidenced significant "ex- 
cess" clustering in the large-scale shape of the two-point angular 
and projected correlation function. The redshift information from 
zCOSMOS, in particular, shows this excess to dominate in the red- 
shift range 0.5 < z < 1 (Meneux et al. 2009). More precisely, the 
shape of the projected two-point correlation function w p (r p ) ap- 
pears to decay much less rapidly than observed at similar redshifts 
in independent data as the VVDS survey (Meneux et al. 2008) and 
with respect to predictions of standard ACDM cosmology as in- 
carnated by the Millennium simulation (De Lucia & Blaizot 2007; 
Kitzbichler & White 2007). The observed flat shape 1 is difficult to 
reconcile with the theory, unless an unrealistic scale-dependent bias 
between galaxies and matter is advocated. While plausibly related 
to the presence of particularly rich large-scale structures dominat- 
ing the COSMOS volume around z ~ 0.7 (e.g. Meneux et al. 2009; 
), this effect still awaits a quantitative explanation. 

In a recent series of papers, Abbas & Sheth (2005, 2006, 2007) 
have used the Sloan Digital Sky Survey (SDSS, York et al. 2000), 
together with Halo Occupation Distribution models (HOD, e.g. 
Cooray & Sheth 2002) to show how in general the amplitude and 
shape of the galaxy correlation function depend on the environment 
in which the galaxies are found. Once a local density is suitably de- 
fined over a given scale, galaxies living in over-dense regions show 
a stronger clustering than those in average or under-dense environ- 
ments. This is shown to be a consequence of the direct correlation 
arising in hierarchical clustering between the mass of the dark mat- 
ter halos in which galaxies are embedded, and their large-scale en- 
vironment: the mass function of dark-matter halos is top-heavy in 
high-density regions, thus selecting galaxies in these environments 
we are selecting halos of higher mass, which are more clustered. 
The net result is to introduce a scale-dependent bias in the observed 
correlation function, when this is compared to the expected dark- 
matter clustering (Abbas & Sheth 2006, 2007). 

In this Letter we investigate whether this effect is at work also 
at z ~ 0.7 and could explain quantitatively the observed shape of 
w p (r p ) in the zCOSMOS data. 



2 DATA AND METHODS 

2.1 The zCOSMOS lOk-bright sample 

zCOSMOS is a large spectroscopic survey performed with the Vis- 
ible Multi-Object Spectrograph (VIMOS, Le Fevre et al. 2003) at 
the ESO-VLT. The zCOSMOS-bright survey (Lilly et al. 2007) has 
been designed to follow-up spectroscopically the entire 1.7 deg 2 



COSMOS-ACS field (Scoville et al. 2007; Koekemoer et al. 2007) 
down to Iab — 22.5. We use in this analysis the first-epoch set of 
redshifts, usually referred to as the zCOSMOS lOk-bright sample 
("10k sample", hereafter), including 10, 644 galaxies. At this mag- 
nitude limit, the survey redshift distribution peaks at z ~ 0.6, with 
a tail out to z ~ 1.2. We only consider secure redshifts, i.e. confi- 
dence classes 4.x, 3.x, 9.3, 9.5, 2.4, 2.5, and 1.5, representing 88% 
of the full 10k sample (see Lilly et al. 2009, for details) and 20.4% 
of the complete Jab < 22.5 magnitude-limited parent sample over 
the same area. These data are publicly available through the ESO 
Science Data Archive site 2 . 



2.2 Mock galaxy surveys 

In addition to the observed data, in this analysis we also make use 
of a set of 24 mock realisations of the zCOSMOS survey, con- 
structed combining the Millennium Run N-body simulation 3 , with 
a semi-analytical recipe of galaxy formation (De Lucia & Blaizot 
2007). The Millennium Run is a large dark matter N-body simula- 
tion that follows the hierarchical evolution of 2160 3 particles be- 
tween z = 127 and z — in a cubic volume of 500 3 h~ 3 Mpc 3 . 
It assumes a concordance cosmological ACDM model with 
(O m , ft Aj «b, h, n, cr 8 ) = (0.25, 0.75, 0.045, 0.73, 1, 0.9). 
The resolution of the N-body simulation, 8.6 x 10 8 h _1 MQ, cou- 
pled with the semi-analytical model allows one to resolve with a 
minimum of 100 particles halos containing galaxies with a lumi- 
nosity of 0.1L* (see Springel et al. 2005). Galaxies are generated 
inside these dark matter halos using the semi-analytic model of 
Croton et al. (2006), as improved by De Lucia & Blaizot (2007). 
This model includes the physical processes and requirements orig- 
inally introduced by White & Frenk (1991) and refined by Kauff- 
mann & Haehnelt (2000), Springel et al. (2001), De Lucia et al. 
(2004), and Springel et al. (2005). Twenty-four mocks are created, 
and then "observed" as to reproduce the zCOSMOS selection func- 
tion (lovino et al. 2010). 



2.3 Local density estimator 

To characterise galaxy environment we use the dimensionless den- 
sity contrast measured by Kovac et al. (2010) around each galaxy 
in the sample. For each galaxy at a comoving position r we com- 
pute the dimensionless 3D density contrast smoothed on a scale R, 
8 g (r,R) = (p(r,R) — p(r))/p(r), where p(r,R) is the density 
of galaxies measured on a scale R and p(r) is the overall mean 
density at r. p(r, R) is estimated around each galaxy of the sam- 
ple by counting objects within an aperture (defined either through a 
top-hat of size R or a Gaussian filter with similar dispersion). The 
reconstructed over-densities are properly corrected for the survey 
selection function and edge effects. Kovac et al. (2010) studied dif- 
ferent density estimators, corresponding to varying galaxy tracers, 
filter shapes and smoothing scales. Here we use 8 g as reconstructed 
with a Gaussian filter with dispersion R = 5h~ x Mpc. Note that 
the mass enclosed by such filter is equal to that inside a top-hat 
filter of size ~ 7.8 h -1 Mpc. We refer the reader to Kovac et al. 
(2010) for a full description of the technique. 



1 7 ~ 1.5 instead of the 7 ~ 1.8expected when approximating £(r) with 
apowerlaw (i.e. £(r) = (ro/V) 7 ) below r = 10h _1 Mpc 



http://archive.eso.org/cms/eso-data/data-packages/ 
http://www.mpa-garching.mpg.de/millennium/ 
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2.4 Expected probability distribution function of the density 
contrast 

The density contrast distribution can be predicted analytically using 
some approximations. Empirically, it has been found that the prob- 
ability distribution function (PDF) of the mass density contrast in 
real (comoving) space smoothed on a scale R is well described by 
a lognormal distribution (Coles & Jones 1991), 



Pffl = ^; /2 exp 



(ln(l+*)+c&) 



2c4 



(1) 



where w| = ln(l + (S 2 ) R ) and (S 2 ) R = cr R (z), with ctr(z) 
being the standard deviation of mass fluctuations at redshift z on 
the same scale: 



cr R (z) 



dk k 3 P(k, z) 
~k 2^ 
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Here P(k, z) is the mass power spectrum at redshift z in the 
adopted cosmology and W(x) is the Fourier transform of the 
smoothing window function. For our purpose we use the mass 
power spectrum of Smith et al. (2003), which includes the non- 
linear evolution of the initial mass fluctuations field. 

The density field recovered from redshift surveys is affected 
by galaxy peculiar motions. Therefore one needs to convert the 
real-space PDF model into redshift-space in order to be able to 
compare it to observations. It has been found that the redshift-space 
PDF of the density contrast is still well described by a lognormal 
distribution (Sigad et al. 2000), with a standard deviation o R (£) 
related to that in real space as (Kaiser 1987), 

*Uz)=(l + lf(z) + ±f 2 (z)y an{z) . (3) 

Here a R and a R are respectively the real- and redshift-space stan- 
dard deviations and f(z) denotes the growth rate of structure, 
which in the framework of General Relativity is well approximated 
as f(z) ~ n m 55 (z) (Wang & Steinhardt 1998). 

Following this procedure we obtain the PDF of the mass den- 
sity contrast in redshift-space. To obtain that of galaxies we have 
to further apply a biasing factor. For our purposes here, we simply 
assume linear deterministic biasing, setting S g = b8 (however, see 
Marinoni et al. 2005). We choose a value b = V2.05, as required to 
match the large-scale amplitude of the two-point correlation func- 
tion of galaxies in our sample, as we shall show in Sec. 3. 

By definition, the PDF described by equation 1 refers to the 
distribution of 5 as measured in randomly placed spheres within 
the survey volume. On the other hand, for the data we have at our 
disposal only the conditional values of S g as measured in volumes 
centred on each galaxy in the sample. Given the probabilistic mean- 
ing of the distribution function of the density V (p) in the two cases 
they must be related as 



v4p) = r f {p) = %v{p) 
pp(p)d P 



(4) 



Being P(S) = pP(p), the corresponding relation between the 
PDFs of the density contrast P(S) is 



Pc(3) = 



(1 + 5)P(5) 
1 + S 



(5) 



where 1 + 5 = 1. 

We are then in the position to compare the theoretically pre- 
dicted P c (normalised to the total number of galaxies in the sam- 



ple) to the observed distribution. This is presented in Fig. 1, to- 
gether with the mean and scatter (68% confidence corridor) of the 
24 mock samples. The analytical prediction and the mean of the 
mocks are in fair agreement (although they disagree in the details 
at the la level). Note, however, that the detailed shape and ampli- 
tude of the analytical prediction are quite sensitive to the choice of 
the effective redshift z of the survey. Here we have used the mean 
value z ~ 0.56 yielded by the actual redshift distribution dN/dz 
of the survey, but using e.g. z ~ 0.6 would give a better agreement 
with the PDF from the simulations. Additionally, the analytical pre- 
diction cannot include the small scale "Finger-of-God" effect due 
to high velocities in clusters (which however has the effect to re- 
duce power on small scales). Finally, it has been computed using 
the more up-to-date as = 0.8, to check the impact of the value 
as = 0.9 used for the Millennium Run. Beyond these points, the 
simple goal of the analytical model is to show an alternative - yet 
more idealised - example, in addition to the mocks, of what one 
should expect from the theory. What is relevant for this work is 
that the conditional PDF of the data differs strongly from both the- 
oretical predictions. Peaking at 5 g ~ —0.2, it shows an extended 
high-density tail out to S g ~ 7. The distribution expected from the 
models is more peaked around 5 g = and drops more rapidly for 
8 g > 2. This plot clearly shows a statistically significant excess of 
high-density regions in the galaxy data. 4 



2.5 Clustering estimation 

We estimate real-space galaxy clustering using the standard pro- 
jected two-point correlation function, w p (r p ), that properly cor- 
rects for redshift-space distortions due to galaxy peculiar motions. 
This is obtained by projecting the two-dimensional two-point cor- 
relation function £(r p , n) along the line-of-sight 5 



w p (r p ) 



2 / £(r p ,7r)d7T 
'o 



(6) 



where r p and n are the components of the galaxy-galaxy separa- 
tion vector respectively perpendicular and parallel to the line-of- 
sight (Peebles 1980; Fisher et al. 1994). £(r p , n) is measured using 
the Landy & Szalay (1993) estimator and properly accounting for 
the survey selection function and various incompleteness effects, as 
thoroughly described in de la Torre et al. (2009). Error bars are esti- 
mated through the blockwise bootstrap method (e.g. Porciani & Gi- 
avalisco 2002; Norberg et al. 2009). This is discussed in detail and 
compared to results from mock samples in Meneux et al. (2009) 
and Porciani et al. (2010). All clustering codes and methods used 
here have been extensively tested against independent programs in 
the course of the latter analyses. 



4 An ongoing analysis of the fuller COSMOS sample to Iab = 24 based 
on photometric redshifts (Scoville et al. 2010, in preparation), seems to in- 
dicate a better agreement of the observed PDF to that of the Millennium 
mocks. This might be explained as a consequence of the larger volume 
of the sample used (lower cosmic variance), together with the fairly large 
smoothing window used to define the over-densities and, most importantly, 
the blurring of the PDF produced by the photometric redshift errors. 

5 In practice, a finite value for the the upper integration limit is adopted. We 
use 20 h~ 1 Mpc which recovers the signal dispersed by redshift distortions, 
while minimising the noise that dominates at large values of n (see also 
Meneux et al. 2009; Porciani et al. 2010). 
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Figure 1. The probability distribution function of the density contrast, mea- 
sured around each galaxy in the current zCOSMOS 10k catalogue as dis- 
cussed in the text (histogram). The solid line and shaded area correspond 
respectively to the mean and ltr dispersion of the same statistics, measured 
on the 24 Millennium mocks; the dashed curve gives instead, as reference, 
the expected theoretical distribution for a lognormal model in a ACDM 
cosmology with (f! m , f^A, erg) = (0.25, 0.75, 0.8), at the mean redshift 
of the 10k sample, computed as discussed in the text. The vertical solid lines 
correspond to values of the density contrast excluding the top 5%, 10% and 
15% of the distribution. 



3 RESULTS AND DISCUSSION 

In Fig. 2 we show the projected correlation function w p (r p ) com- 
puted for the 10k sample in the redshift range 0.6 < z < 1 (top 
curve), together with those from a series of sub-samples in which 
we gradually eliminated galaxies located in the most dense envi- 
ronments. We excluded, respectively, the top 5%, 10%, and 15% 
fractions of the distribution of over-densities, corresponding to the 
dashed vertical lines in Fig. 1. w p (r p ) for the full 10k sample shows 
a very flat shape, with significant "excess clustering" above 1 h~ 
Mpc, as seen in previous analyses of the COSMOS/zCOSMOS 
data. When galaxies in the densest environments are excluded, 
however, the large-scale "shoulder" gradually disappears. What we 
see is a clear dependence of the mean large-scale clustering of 
galaxies on the type of environments they inhabit, similarly to the 
results of Abbas & Sheth (2007) from the SDSS. 

In the same figure we also show the "universal" w p (r p ) ex- 
pected in the standard ACDM cosmological model with linear bias- 
ing. The theory predictions are, again, obtained in two ways. First, 
we use HALOF1T (Smith et al. 2003) to compute directly the ap- 
proximated non-linear mass power spectrum expected at the sur- 
vey mean redshift. Secondly, we compute the average and scatter 
of w p (r p ) from the 24 mock samples. Remarkably, the two curves 
(solid and dashed black lines) are virtually indistinguishable above 
1 h Mpc once the HALO FIT mass correlation function is prop- 
erly multiplied by an arbitrary linear bias factor of b 2 = 2.05. The 
comparison to the data shows a very good agreement for the 10k 
sub-sample in which the 10% densest environments were excluded. 
We note that the shape of w p (r p ) measured from the independent 



Figure 2. The projected two-point correlation function w p (r p ) of the 
zCOSMOS 10k at 0.6 < z < 1, compared to sub-samples in which galax- 
ies living in the densest environments are gradually excluded (top to bot- 
tom). To reduce confusion, error bars are shown for the main sample only, 
being in general of amplitude comparable to the scatter of the mock samples 
indicated by the shaded area. The thick solid line and surrounding shaded 
corridor correspond in fact to the mean and 1<t scatter of the 24 mock sur- 
veys. For comparison, the dashed curve also shows the HALOFIT (Smith 
et al. 2003) analytic prescription for the non-linear mass power spectrum 
(assuming ACDM with (Q m , fl A , ct 8 ) = (0.25, 0.75, 0.8)), multiplied 
by an arbitrary linear bias b 2 = 2.05. The shape of w p (r p ) for zCOSMOS 
galaxies agrees with the models when the 10% densest environments are 
eliminated. 



VVDS survey shows a shape which is closer to the model predic- 
tions (Meneux et al. 2009). With this thresholding in density of 
the 10k data, therefore, we are able to bring the measured shape 
of galaxy clustering at z ~ 0.7 from zCOSMOS, VVDS and the 
standard cosmological model within close agreement, suggesting 
a more quantitative interpretation of the flat shape of w p (r p ) ob- 
served in zCOSMOS at these redshifts. In previous papers (e.g. 
Meneux et al. 2009; Kovac et al. 2009) we already suggested that 
this could be due to the presence of particularly significant large- 
scale structures between z = 0.5 and z — 1. Here we see that it 
is in fact driven by an excess of galaxies sampling high-density re- 
gions, skewing the density distribution away from the supposedly 
"Universal". Fig. 3 shows where these high-density galaxies are 
actually located within the 10k sample. The galaxies belonging to 
the 10% high-density tail are marked by (red) circles and turn out 
to belong to a few very well-defined structures only. 6 It is easy to 
imagine that if embedded in a larger volume, these structures would 
not weight so much as to modify significantly the overall shape of 
the PDF. As seen from the histogram in Fig. 1, however, in this vol- 



B We also directly tested whether the two-point correlation functions com- 
puted on the density-thresholded samples were by any means sensitive 
to the way the "depleted" sub-volumes were treated (e.g. kept in or ex- 
cluded when building the standard reference random sample); no significant 
changes were found. 
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Sheth 2007) that the two results are in fact consistent with each 
other (Ravi Sheth, private communication). 

These results highlight the importance in redshift surveys of 
an accurate reconstruction of the density field, to evidence possible 
peculiarities in the overall PDF as sampled by that specific cata- 
logue. Further strengthening the results obtained by Abbas & Sheth 
( 07) at z ~ 0, we have shown that an anomalous density distribu- 
tion function can significantly bias the recovered two-point correla- 
tion function, making it difficult to draw general conclusions from 
its shape. This result provides another example of the intrinsic dif- 
ficulty existing when comparing observations of the galaxy distri- 
bution to theoretical predictions. The theory provides us with fairly 
accurate forecasts for the distribution of the dark matter and for that 
of the halos within which we believe galaxies form (e.g. Mo et al. 
1996; Sheth & Tormen 1999). However, translating galaxy cluster- 
ing measurements into constraints for the halo clustering involves 
understanding how the selected galaxies populate halos with differ- 
ent mass. The result presented here show how a sample particularly 
rich in dense structures favours higher-mass halos, which in turn are 
more clustered, thus biasing the observed correlation function as a 
function of scale. A more detailed analysis of the environmental 
dependence of galaxy clustering in the zCOSMOS-Bright sample 
and related HOD modelling will be presented in a future paper. 



X-Y aspect ratio 1:1 



X-Y aspect ratio 1:4 



Figure 3. The spatial distribution of galaxies with 0.6 < z < 1 (dots) in 
the zCOSMOS 10k sample, highlighting those inhabiting the 10% highest- 
density tail of the distribution (circles). These galaxies clearly belong to 
a few well-defined structures. The right-hand plot is simply an expanded 
version of the pencil-beam on the left, to enhance visibility. 



ume there is a clear over-abundance of high-density galaxy environ- 
ments, while regions with average density are under-represented. 

One may wonder however, whether the theoretical model rep- 
resented by the Millennium mocks can be taken as a reliable ref- 
erence. In fact, it has been shown that this specific model tends to 
overestimate the overall amplitude of w p (r p ) at z ~ 1 and does 
not reproduce the observed clustering segregation in colour (e.g. 
Coil et al. 2008; de la Torre et al., in preparation) In general, semi- 
analytical recipes do tend to affect the amplitude and shape of the 
correlation function. This however happens only on small scales, 
where the complex interplay between galaxy formation processes 
and the distribution of dark-matter halos has an impact. On large 
scales instead, they predict a fairly linear biasing, as we can see 
directly comparing the solid and dashed black lines in Fig. 2. This 
means that the large-scale shape of the correlation function is es- 
sentially driven by the underlying mass distribution in the assumed 
cosmological model and not by the details of the semi-analytic 
recipe adopted to generate galaxies. A different recipe would not af- 
fect, therefore, the results obtained here, unless we postulate the ex- 
istence of dramatically non-local galaxy formation processes (e.g. 
Narayanan et al. 2000). 

We also note from Fig. 2 that the dependence of w p (r p ) on 
the PDF threshold is essentially on large scales. Below ~ lh _1 
Mpc there is no significant change when denser and denser environ- 
ments are excluded. In their analysis of the SDSS Abbas & Sheth 
(2007) consider sub-samples defined as extrema of the density dis- 
tribution, i.e. using galaxies lying on the tails of the distribution on 
both sides. With this selection, they find a change in w p (r p ) for 
different environments also on small scales. It can be shown sim- 
ply using the conservation of galaxy pairs (see Eq. 1 of Abbas & 
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